Fourth Italian Information Retrieval Workshop IIR 2013
نویسندگان
چکیده
In this paper, we present some ideas about possible directions of a new interpretation of the Okapi BM25 ranking formula. In particular, we have focused on a full bayesian approach for deriving a smoothed formula that takes into account a-priori knowledge on the probability of terms. In fact, most of the efforts in improving the BM25 were done in capturing the language model (frequencies, length, etc.) but missed the fact that the constant equal to 0.5 used as a correction factor can be one of the parameters that can be modelled in a better way. This approach has been tested on a visual data mining tool and the initial results are encouraging.
منابع مشابه
A Pluggable Work-bench for Creating Interactive IR Interfaces
Information Retrieval (IR) has benefited from standard evaluation practices and re-usable software components, that enable comparability between systems and experiments. However, Interactive IR (IIR) has had only very limited benefit from these developments, in part because experiments are still built using bespoke components and interfaces. In this paper we propose a flexible workbench for con...
متن کاملBuilding a Common Framework for IIR Evaluation
Cranfield-style evaluations standardised Information Retrieval (IR) evaluation practices, enabling the creation of programmes such as TREC, CLEF, and INEX, and long-term comparability of IR systems. However, the methodology does not translate well into the Interactive IR (IIR) domain, where the inclusion of the user into the search process and the repeated interaction between user and system cr...
متن کاملNTCIR-4: Outline of Invited Talk at CLEF 2004 Workshop
This talk will present the fourth NTCIR Workshop, which is the latest in a series of evaluation workshops designed to enhance research in information access (IA) technologies including information retrieval (IR), crosslingual information retrieval (CLIR), automatic text summarization, question answering, text mining and so on by providing large-scale test collections and a forum for researchers...
متن کاملSupporting and Evaluating Whole-Session Interactive Information Retrieval
Information retrieval (IR) research and practice has traditionally been concerned with providing information seekers with a response to a request for information, and the evaluation of how good that response has been. As is recognized by this workshop, this “single-shot” approach to system support for information seeking, despite its successes, is inadequate in many ways as a model for support ...
متن کاملEuroWordNet: a multilingual database for information retrieval
The aim of the EuroWordNet-project is the development of a database with wordnets for English, Spanish, Dutch and Italian, similar to the Princeton WordNet1.5, which contains basic semantic relations between words in English. The Dutch, Italian and Spanish wordnets will be linked to the WordNet1.5 using equivalence relations. The resulting multilingual database can directly be used in (multi-li...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013